This research paper presents parametrization of emotional speech using a pool of common features utilized in\nemotion recognition such as fundamental frequency, formants, energy, MFCC, PLP, and LPC coefficients. The pool is\nadditionally expanded by perceptual coefficients such as BFCC, HFCC, RPLP, and RASTA PLP, which are used in speech\nrecognition, but not applied in emotion detection. The main contribution of this work is the comparison of the\naccuracy performance of emotion detection for each feature type based on the results provided by both k-NN and\nSVM algorithms with 10-fold cross-validation. Analysis was performed on two different Polish emotional speech\ndatabases: voice performances by professional actors in comparison with the author�s spontaneous speech.
Loading....